AITopics | optimization procedure

5487e79fa0ccd0b79e5d4a4c8ced005d-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 23:05:35 GMT

artificial intelligence, machine learning, optimization, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Communications > Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)

Add feedback

On UMAP's True Loss Function

Neural Information Processing SystemsApr-25-2026, 07:43:04 GMT

UMAP has supplanted t-SNE as state-of-the-art for visualizing high-dimensional datasets in many disciplines, but the reason for its success is not well understood. In this work, we investigate UMAP's sampling based optimization scheme in detail. We derive UMAP's true loss function in closed form and find that it differs from the published one in a dataset size dependent way. As a consequence, we show that UMAP does not aim to reproduce its theoretically motivated high-dimensional UMAP similarities. Instead, it tries to reproduce similarities that only encode the knearest neighbor graph, thereby challenging the previous understanding of UMAP's effectiveness. Alternatively, we consider the implicit balancing of attraction and repulsion due to the negative sampling to be key to UMAP's success. We corroborate our theoretical findings on toy and single cell RNA sequencing data.

artificial intelligence, machine learning, similarity, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness

Neural Information Processing SystemsApr-25-2026, 06:15:25 GMT

Scaling up model sizes can lead to fundamentally new capabilities in many machine learning (ML) tasks. However, training big models requires strong distributed system expertise to carefully design model-parallel execution strategies that suit the model architectures and cluster setups. In this paper, we develop AMP, a framework that automatically derives such strategies. AMP identifies a valid space of model parallelism strategies and efficiently searches the space for high-performed strategies, by leveraging a cost model designed to capture the heterogeneity of the model and cluster specifications. Unlike existing methods, AMP is specifically tailored to support complex models composed of uneven layers and cluster setups with more heterogeneous accelerators and bandwidth. We evaluate AMP on popular models and cluster setups from public clouds and show that AMP returns parallel strategies that match the expert-tuned strategies on typical cluster setups. On heterogeneous clusters or models with heterogeneous architectures, AMP finds strategies with 1.54 and 1.77 higher throughput than state-of-the-art model-parallel systems, respectively.

artificial intelligence, machine learning, optimization problem, (20 more...)

Neural Information Processing Systems

Genre: Research Report (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

A General Method for Amortizing Variational Filtering

Neural Information Processing SystemsMar-16-2026, 17:57:08 GMT

We introduce the variational filtering EM algorithm, a simple, general-purpose method for performing variational inference in dynamical latent variable models using information from only past and present variables, i.e. filtering. The algorithm is derived from the variational objective in the filtering setting and consists of an optimization procedure at each time step. By performing each inference optimization procedure with an iterative amortized inference model, we obtain a computationally efficient implementation of the algorithm, which we call amortized variational filtering. We present experiments demonstrating that this general-purpose method improves inference performance across several recent deep dynamical latent variable models.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

5487e79fa0ccd0b79e5d4a4c8ced005d-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 17:35:10 GMT

algorithm, estimator, optimization, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > New York (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Communications > Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)

Add feedback

2b4bfa1cebe78d125fefd7ea6ffcfc6d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 02:05:31 GMT

bandwidth, cost model, heterogeneity, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California > Alameda County > Berkeley (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

2de5d16682c3c35007e4e92982f1a2ba-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 01:46:36 GMT

high-dimensional similarity, similarity, umap, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Germany > Baden-Württemberg > Karlsruhe Region > Heidelberg (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

OptimizingConditionalValue-At-Risk ofBlack-BoxFunctions

Neural Information Processing SystemsFeb-7-2026, 20:13:39 GMT

A wide range of applications from Auto-ML [15] to chemistry [6] and drug design [3] require optimizing ablack-boxobjectivefunction (i.e.,itsclosed-form expression, gradient, andconvexity are unknown) through observing noisy function evaluations.

artificial intelligence, optimization problem, vlt 1, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.37)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)

Add feedback

186b690e29892f137b4c34cfa40a3a4d-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 15:16:05 GMT

correspond, rebuttal figure 1, regularization, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.30)

Technology: Information Technology > Artificial Intelligence (0.32)

Add feedback

Gradient-based grand canonical optimization enabled by graph neural networks with fractional atomic existence

Christiansen, Mads-Peter Verner, Hammer, Bjørk

arXiv.org Artificial IntelligenceNov-25-2025

Machine learning interatomic potentials have become an indispensable tool for materials science, enabling the study of larger systems and longer timescales. State-of-the-art models are generally graph neural networks that employ message passing to iteratively update atomic embeddings that are ultimately used for predicting properties. In this work we extend the message passing formalism with the inclusion of a continuous variable that accounts for fractional atomic existence. This allows us to calculate the gradient of the Gibbs free energy with respect to both the Cartesian coordinates of atoms and their existence. Using this we propose a gradient-based grand canonical optimization method and document its capabilities for a Cu(110) surface oxide.

artificial intelligence, existence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1088/2632-2153/ae1bf6

2507.19438

Genre: Research Report (0.84)

Industry: Materials > Chemicals (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Filters

Collaborating Authors

optimization procedure

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

5487e79fa0ccd0b79e5d4a4c8ced005d-Paper.pdf

On UMAP's True Loss Function

AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness

A General Method for Amortizing Variational Filtering

5487e79fa0ccd0b79e5d4a4c8ced005d-Paper.pdf

2b4bfa1cebe78d125fefd7ea6ffcfc6d-Paper-Conference.pdf

2de5d16682c3c35007e4e92982f1a2ba-Paper.pdf

OptimizingConditionalValue-At-Risk ofBlack-BoxFunctions

186b690e29892f137b4c34cfa40a3a4d-AuthorFeedback.pdf

Gradient-based grand canonical optimization enabled by graph neural networks with fractional atomic existence